A Computer-based Articulation Training Aid for Short Words (cata)

نویسندگان

  • Mukund Devarajan
  • Stephen A. Zahorian
  • Vijayan K. Asari
چکیده

A COMPUTER-BASED ARTICULATION TRAINING AID FOR SHORT WORDS (CATA) Mukund Devarajan Old Dominion University, 2003 Director: Dr. Stephen A. Zahorian Several improvements in the vowel articulation training aid (VATA) are described, as well as the efforts to extend the visual feedback system to operate with short words in the form of consonant, vowel and consonant (CVC). The extended version of the visual feedback system is referred to as CATA (Computer-based Articulation Training Aid); the vowel version of the aid (VATA) only operates with ten American English monopthong vowels. Improvements in VATA include the use of a neural network (NN) recognizer method to prune a large database of vowel recordings to eliminate noisy and/or mispronounced tokens. The spectral jitter problem, previously present in the VATA, has also been corrected. Initial steps in the development of CATA involved database preparation. The training methodologies and the step-by-step procedure for using Hidden Markov Modeling (HMM) for recognizing and segmenting a CVC database are described. The signal processing and recognition steps involved in building a real-time display system to provide visual feedback about the quality of pronunciation of the CVCs are described in detail. An attempt at using a time-delay neural network (TDNN) classifier for distinguishing phonemes present in the CVCs is described. Experiments conducted to improve the VATA and the initial results obtained with the CVC display system are reported.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HMM-neural network monophone models for computer-based articulation training for the hearing impaired

A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. In previous papers, the signal processing steps and display options have been described for giving real-time feedback about the quality of pronunciation for 10 steady-state American English monopthong vowels (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /u...

متن کامل

Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes

This paper describes how seven French subjects change their pronunciation and articulation when practising Swedish words with a computer-animated virtual teacher. The teacher gives feedback on the user’s pronunciation with audiovisual instructions suggesting how the articulation should be changed. A wizard-of-Oz set-up was used for the training session, in which a human listener choose the adeq...

متن کامل

Discriminative and Maximum Likelihood Classifiers for Computer-Based Visual Feedback for Speech Training for the Hearing Impaired

A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. The training aid provides real time visual feedback as to the quality of pronunciation for 10 steady-state American English monopthong vowel phonemes (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This training aid is thus referred to as a Vowel Arti...

متن کامل

Vowel classification for computer-based visual feedback for speech training for the hearing impaired

A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. The training aid provides real time visual feedback as to the quality of pronunciation for 10 steady-state American English monopthong vowels (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This training aid is thus referred to as a Vowel Articulation...

متن کامل

Personal computer software vowel training aid for the hearing impaired

A vowel training aid system for hearing impaired persons which uses a Windows-based multimedia computer has been developed. The system provides two main displays which give visual feedback for vowels spoken in isolation and short word contexts. Feature extraction methods and neural network processing techniques provide a high degree of accuracy for speaker independent vowel training. The system...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004